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In a given market, financial covariances capture the intra- 
stock correlations and can be used to address statistically the 
bulk nature of the market as a complex system. We provide a 
statistical analysis of three SP500 covariances with evidence 
for raw tail distributions. We study the stability of these tails 
against reshuffling for the SP500 data and show that the co- 
variance with the strongest tails is robust, with a spectral 
density in remarkable agreement with random Levy matrix 
theory. We study the inverse participation ratio for the three 
covariances. The strong localization observed at both ends 
of the spectral density is analogous to the localization exhib- 
ited in the random Levy matrix ensemble. We discuss two 
competitive mechanisms responsible for the occurrence of an 
extensive and delocalized eigenvalue at the edge of the spec- 
trum: (a) the Levy character of the entries of the correla- 
tion matrix and (b) a sort of off-diagonal order induced by 
underlying inter-stock correlations, (b) can be destroyed by 
reshuffling, while (a) cannot. We show that the stocks with 
the largest scattering are the least susceptible to correlations, 
and likely candidates for the localized states. We introduce a 
simple model for price fluctuations which captures behavior 
of the SP500 covariances. It may be of importance for assets 
diversification. 



I. INTRODUCTION 

A number of phenomena in nature are characterized 
by a coexistence of different scales, usually described 
by power law distributions. This is the case of most 
phase transitions where at the critical point the corre- 
lation functions are scale invariant, as well as most fluid 
phases in highly developed turbulence where the velocity 
fluctuations are sensitive to a variety of eddies. Power 
law distributions are also encountered in a number of 
biophysical settings as well as financial mar kets 0. 

Stable random phenomena with power law behaviors 
are usually described by Levy distributions, a conse- 
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qucncc of the central limit theorem for scale-free pro- 
cesses. The simplest example is a random walk with 
a power law distribution for single independent steps, 
where the relative probabilities at different times are 
scale free. These phenomena lead to anomalous diffusion 
and intermittency as encountered in charge transport in 
amorphous semiconductors, moving interfaces in porous 
media, spin glasses, turbulence ||^ and phase changes in 
chiral QCD g. 

Recently, it was pointed out that current market co- 
variances are gaussian noise driven with possible con- 
sequences for the assessment of correlations in portfo- 
lio evolution and optimization [Q. In particular, it was 
shown that the lower part of eigenvalue distribution of 
the SP500 covariance matrix constructed from the daily 
returns normalized by the local volatility, is Gaussian 
noise dominated. In this paper we confirm some of these 
observations, but suggest that an alternative covariance 
constructed from the daily returns normalized to the ini- 
tial price displays Levy noise throughout the spectrum. 
The latter is more robust against reshuffling and certainly 
requires a random Levy matrix description. This obser- 
vation is overall consistent with a recent observation we 
made in the context of free random Levy matrices ^ . 

The outline of the paper is as follows: in section 2, 
we introduce the concept of financial correlation matri- 
ces, and empirically analyze their statistical content. We 
show that all covariances display power law tails, albeit 
with different indices. In section 3 we discuss the issue of 
inter-stock correlations and we define reshuffling of the 
price series and investigate its effects on the covariance 
matrices. We find that the covariance with the largest 
tails is the least sensitive to this process. This is dis- 
cussed in section 4, where we analyze the corresponding 
spectral densities and show that the results of random 
Levy matrices apply remarkably well to the covariance 
that is stable under reshuffling. In section 5, we analyze 
the bulk eigenvector content of the all covariances and 
parallel them with the results of reshuffling and Levy ran- 
dom matrix theory. The larger the tails, the stronger the 
localization seen in the participation ratios and the stock 
scattering. In section 6, we formulate a simple model 
of price fluctuations, which reproduces most of the ex- 
perimentally observed features of the SP500 covariances. 
Our conclusions are in section 7. 
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II. FINANCIAL COVARIANCES 

One of the central problems in financial investment 
is the assessment of risk. Standard lore suggests that 
risk can be reduced through assets diversification, with 
Markowitz's portfolio analysis as one of the corner-stones 
in assets allocation and diversification 0. The key to 
Markowitz's analysis is the concept of a covariance ma- 
trix. In this section we define and empirically analyze the 
distribution of entries and also correlations as captured 
in certain SP500 covariance matrices, with comparison to 
results from random Levy matrices. Throughout, we will 
use price return data from the SP500 daily quotations of 
N = 406 stocks over the period of T -I- 1 — 1309 days 
from 01.01.1991 tiU 06.03.1996 (ignoring dividends). 

A. SP500 covariances 

Consider the covariance matrix constructed from the 
raw returns normalized by the initial price: 

J t = l ^ t = l "J 

The raw returns mu of stock i (out of a total of N) at 
time t, labeled by an integer (t = 1, . . .T) are evaluated 
at fixed time intervals in a given market as 

mu = 6x,{t) ~ 6xi (2) 

where: Sxi(t) = Xi{t + I) — Xi{t). The mean dxi = 
J2t Sxi(t)/T is subtracted^ 

The choice of the normalization Mu — mu/xoi to the 
initial price preserves the nature of the tails and is scale 
invariant. The use of the relative returns instead of the 
logarithm of the ratio of the consecutive returns, is moti- 
vated by the additive rather than multiplicative character 
of the price series. 

In the following, we will argue that the C covariance 
matrix is able to reveal a fat-tail nature of the price 
change fluctuations, contrary to the covariances G and 
J, defined below. From the point of view of preserving 
power law tails, as a normalization one could alterna- 
tively use the prices, Xri, at any random time r. How- 
ever, from the calculational point of view involving the 
integrated return or the portfolio risk, the choice of the 
initial price is most convenient. 

Commonly, the following covariance is used in the anal- 
ysis: 



^In fact, all results presented in the paper would not change 
almost at all, and would not affect any conclusion, if the mean 
were not subtracted. 



^ 1 ^ rnurn^ 

where now the normalization is given by the volatility 
(variance) Ci with af = niu/T. In the presence of 
fat tails, i. e. if the distribution has a power-law behavior 
p{0 ~ a < 2, the variance itself has a fat tail 

distribution with an index a/2 and the average variance 
does not exist. Obviously, in this case the use of a as 
a normalization will bias the analysis. Alternatively, in 
place of (Ti, one may use the quantity — \mu\/T: 

For 1 < a < 2, Ti has a well defined large T limit. How- 
ever, in practice, for finite T, this normalization, similarly 
as the one for G, obscures the effects of large price-change 
fluctuations. 

A common feature of the quantities Mu = mu/xoi for 
C and Mu = mu/fJi, Mu = mu/fi for G and J, is that 
they are invariant with respect to change in the monetary 
unit. They exhibit, however, a different scale behavior for 
large ^. The raw cumulative probability P<{£,) {P>{0) 
defined as a probability that Mu is less (greater) than 
f , calculated for all i and t, is expected to pick up the 
smallest power of the tail, ^ ±A±^~"± (for gains/losses), 
present in the sample. In Fig. |l| we show the cumulative 
probabilities (a) P<(^) and (b) P>(0 for C. 




FIG. 1. (a) The cumulative distribution P<(C) for 
Mti — mti/xQt as in C and the best fit to the power law. 
(b) The same for P>(0- 

The data in the figure are compared with the power 
laws with: a_ = 1.78 and A_ = 2.4- 10~^ and a+ = 2.18 
A+ — 1.35 • 10^'^. The values are given without errors. 
The SP500 data set does not allow for an accurate de- 
termination of the fit parameters. The given values have 
a qualitative meaning. The fact that they are close to 2 
signals the presence of fat tails of the underlying distri- 
bution. Indeed, the presence of fat-tails will be confirmed 
by the following analysis of the covariance matrices. 

Repeating the same for G we find much thinner tails 
with the following exponents: a_ = 3.8 and a+ = 4.5, 
and for J: a_ = 3.5 and a+ = 4.3. 
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Clearly, the normalization to either the variance 
(volatility), Ci, or the range tends to affect the raw 
tail distributions, with a quenching towards the Gaus- 
sian distribution. This is expected, since the fluctuations 
are roughly normalized to the typical fluctuation. This 
is not the case in C, where the raw tail is retained. 

Let us introduce yet another covariance matrix which 
will be convenient in the further analysis. We will con- 
struct it from the signs Su = sgnmti: 

1 ^ 

Sij = ^ X/ ^ti^tj (5) 
t=l 

We use in our analysis a three-valued sign function: 
sgn = —1, 0, 1. For all assets the average 

(^.>-^E^*.«0 (6) 

and the successive entries in the historically ordered row 
Su are essentially uncorrelated. 

III. CORRELATIONS 

By construction, the financial covariance matrix is 
composed of intra-assets (here stocks) correlations, and 
therefore tells us how closely assets move in time-evolving 
market. The microscopic nature of these correlations is 
so far unknown. However, a quantitative understanding 
can still be achieved statistically. The source of the cor- 
relations is two-fold: real correlations between assets and 
statistical fluctuations. The statistical fluctuations dis- 
appear in the T — oo limit. For finite T, however, even 
in the absence of any real correlations, the non-diagonal 
entries are non-zero. In the gaussian universality they 
fall-off as 1/VT. 

As a measure of correlations between distinct compa- 
nies i j one can use a distribution p{Cij). Similar 
distributions can be constructed for G and S. 

Consider first correlations of pure signs. In Fig. ^a, we 
show the distribution (solid line) of S^ 's histogrammed 
over all pairs i ^ j. We observe a strong asymmetry to- 
wards positive correlations with a maximum around 0.1, 
indicating that assets have a tendency to move collec- 
tively in the same trend: up or down. 

This pronounced asymmetry is present in correlations 
for all other covariances C, J, G. We show in Fig. ^.b 
the distribution of correlations for C. The correlations 
between signs are inherited by all other covariances. We 
shall discuss possible consequences of this behavior in 
the section VI. The inter-stock (inter-sign) correlations 
can be easily destroyed by a procedure of reshuffling de- 
scribed below. Indeed, we see in Fig. ^ (dashed line) that 
after reshuffling the spectra become symmetric. 




FIG. 2. (a) The distribution of correlations of signs S be- 
fore (solid line) and after reshuffling (dashed line), (b) The 
same for C. 



A. Reshuffling 

Let us introduce the abovementioned procedure to re- 
move the inter-stock correlations from the data. Having 
done this, we will be able to concentrate on the issue of 
the stochastic nature of the price fluctuations. 

The price changes 6xi{t) enter the covariance matrix in 
the historical order. This order, in particular preserves 
inter-stock correlations. We can suppress the inter-stock 
correlations in the data, by introducing a random time 
ordering to the time history for each asset. More pre- 
cisely, for each asset, i, we can generate a random per- 
mutation of t-indices P*^*' -.1^1' = P^^^{t). and in- 
stead of the historical ordered rows of returns we can 
use: 5x'^{t) = 5xi{t'), to define: mj^ — Sx'i{t) — Sxi and 
the corresponding covariance matrices C, J' G', and S'. 
The permutations for different rows, are ran- 

dom and mutually independent. Such a reshuffling does 
not change the information content of the individual asset 
rows, because successive entries in the historical ordered 
row, 5xi{t) and 5xi{t -t- 1), are uncorrelated for typical 
time intervals on a market. Thus, the reshuffling affects 
only the inter-row information content destroying any 
correlations. Hence we expect that the reshuffled data 
set should reflect pure stochastic nature. Indeed, it does. 
In Fig. 1^ we show for example the effect of reshuffling on 
the inter-stock correlations. The asymmetry disappears. 

IV. SPECTRA 

In this section we discuss the spectral density associ- 
ated with the covariance matrices defined above. The 
spectral density plays an important role in risk assess- 
ment 

The results will be presented as histograms of eigen- 
values A. We will sort of unify the scale on the A-axis 
by plotting histograms as a function of a quantity A/F 
where F is defined as: 

F = — Tr C — V Ml , (7) 
TV NT /-^ ' ^ ' 

ti 
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and analogously for G, J and S. For a covariance 
of gaussian numbers, the constant F approaches a T- 
independent constant in the limit T ^ oo while for a 
covariance of power-law distributed numbers with 1 < 
a < 2, it behaves as F = -^y^/a-i^ where 7 is a T- 
independent constant. We will explain this scaling in 
more detail in the section about random Levy matrices. 
It is easy to see that by construction F = 1 for G and 
S. The histograms of eigenvalues A/F for the SP500 data 
are presented in Fig. ^. 




2 4 2 4 




2 4 2 4 
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FIG. 3. The eigenvalue histograms for CjG,J and S. For 
G and S we present also the gaussian fit mj. In the inlets 
we show the same histograms but in a range embracing all 
eigenvalues. To make the smallest picks visible we artificially 
enhanced them by setting in inlets the logarithmic scale on 
the vertical axis. Notice that except the one for C all plots 
have the same ranges. 

Let us make here two points: the histograms for G 
and S are almost identical. All spectra have a few large 
eigenvalues. In the next section using Random Gaussian 
Matrices (RGM) we will define the scale which will tell 
us which eigenvalues can be treated as large. 

There are potentially two sources of large eigenvalues 
in the spectrum: inter-stock correlations and fat-tails. In 
the next sections we will discuss methods to pinpoint the 
two effects. 

In the right block of Fig. ^ we plot the eigenvalue den- 
sity of the G' and S' covariances. The spectra for G' 
and S' are again almost identical. In comparison with 
the spectra for the historically ordered data set, right 
block of Fig. H, we see that the large eigenvalues disap- 
pear: the largest eigenvalue of G was 41.95, and for G' 
after reshuffling 2.77. Moreover, the G' and S' spectra 
fit very well to the curve: 



p{X) ^ — ^/(A — Xmin){Xmax — A) (8) 

with Xmin — 0.20, Xmax — 2.43, predicted by Random 
Gaussian Matrices (RGM) in the large N limit In 
general for the asymmetry parameter a = T/N, the for- 
mula predicts Xmax,min = (1 i l/v^)^, which in partic- 
ular for a = T/N = 1308/406 « 3.22, yields the values 
given above. These values are used in the curve plotted 
in Fig. ^. This agreement clearly indicates that the nor- 
malization to the volatility ct; used in G brings the signal 
to the Gaussian universality. 

It is worth noting that if one attempts to fit the RGM 
result to the eigenvalue histograms for a correlated data 
set one generally obtains values for Xmin, Xmax which 
deviate from the predicted ones. For example, for G and 
S for the historically ordered we get Xmax = 2.22, Xmin = 
0.11 and Xmax = 2.09, Xmin = 0.14, respectively, This 
deviation comes as a compensation for the appearance of 
large eigenvalues which lie far away and corresponds to 
asymmetry parameters a = 2.5 and a = 2.9, respectively. 

As shown in Fig. ^ the large eigenvalues survive reshuf- 
fling in C covariances. Moreover, the position of the 
larger eigenvalues is relatively stable under reshuffling. In 
fact, in each of 20 random reshufflings used in the plot, 
the three larger eigenvalues normalized by the scale F, 
always land in the same histogram bin of the size 0.015. 
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z=A,/r z=XlT 



FIG. 4. The same as in fig. ^ but for reshuffied data set. 
The histograms are averaged over 20 random reshufflings. 

The fact that there are large eigenvalues in the reshuf- 
fled spectrum can be attributed to the presence of heavy 
tails in the probability distributions for price fluctua- 
tions. The behavior of the spectra can be understood, 
as we will see below, in terms of Random Levy Matrix 
(RLM) theory. 
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A. C versus RLM 



B. Scaling in RLM 



We want to compare eigenvalue spectra of C and C 
with the ones obtained from an ensemble of Random 
Levy Matrices (RLM) We generate Levy matri- 

ces (RLM) as follows. We choose N x T elements of 
a matrix Mt^ as independent random numbers from a 
Levy distribution. We find the eigenvalues of the matrix 
Cij — We can repeat this many times 

collecting the eigenvalues in a common histogram. 

Here we would like to mention, that there exists an 
alternative construction of random matrix ensembles, 
based on the concept of free random variables [b|. We 
call this realization Free Levy Matrices (FLM) H. En- 
sembles of FLM, are, contrary to RLM, invariant under 
rotations. They are more easily tractable using analyt- 
ical methods. On the other side, due to the invariance 
of the measure, the eigenvectors do not show interesting 
correlations like in the case of RLM, which we discuss in 
the next sections. In the rest of this paper, we do not 
discuss FLM, and we refer for comparison between the 
FLM and RLM ensembles to |||. 

As an illustration, in our numerical experiment we gen- 
erated RLM with an asymmetry a = T/N = 3.22 ad- 
justed to the asymmetry of the SP500 data set being 
presently considered. The choice of the asymmetry will 
become clear in the next subsection. As the distribution 
for the matrix elements Mu we took a symmetric Levy 
distribution with a = 1.7 close to the value emerging 
from the analysis of the tail behaviour of the cumulative 
probability F<(0. 

In Fig. H we compare the cumulative distributions of 
eigenvalues for C and C, and the random matrix result 
for fixed asymmetry a — 3.22 and size N = 406. We 
see that the large eigenvalues in the spectrum C survive 
reshuffling. The spectrum of RLM and of the reshuffled 
SP500 data set exhibit similar large eigenvalue behaviour. 




O.I I 10 100 I 10 100 

A,/r A,/r 

FIG. 5. (a) Cumulative eigenvalue distribution for the 
SP500 data for the historically ordered data set compared 
with the one for reshuffled data and (b) with the one of a 
randomly generated Levy matrix. 



Assume that Ma are power-law distributed: p(^) ~ 
^-1-°. Define: 

C^.— ^E^^*^^*i (9) 
t 

with the normalization factor 1/T"' whose exponent a 
may differ from 1. We will argue that a natural candidate 
for a is a = 2/ a. 

Let us split C into a diagonal D and off-diagonal A 
parts 

C„- = T>Aj + A,j (10) 

We can use the Central Limit Theorem for Levy univer- 
sality to obtain the distribution of the entries in C in the 
large T limit. We get 

D, ^ T^/"— , A,, ~ ri/"-"a,, , (11) 

where di and aij are T independent constants, dis- 
tributed with the stable Levy distributions. For the di- 
agonal elements di we expect a distribution with an in- 
dex a/2 and the skewness parameter (3—1, while the 
off-diagonal elements will be distributed with a sym- 
metric distribution with an index a. To assess the im- 
portance of the off-diagonal entries on the spectrum, we 
use the standard perturbation theory. For that, we write 

C„- = T2/"-c,j = T2/"- (dAj + T-i/"ay) . (12) 

and expand Cij = diSij + eaij in e = l/T^/". In ze- 
roth order, the eigenvalues of are just di. The first 
order corrections are zero because the matrix Ay is off- 
diagonal. Generically, for a random matrix, d^'s are not 
degenerate, so up to the second order, the eigenvalues of 

(13) 

There are — 1 terms in the sum, each of order unity. 
Thus the sum contributes a factor proportional to N, say 
« SiN, and we have: 

A,; = + SiNT-^^" . (14) 

The off-diagonal terms compete with the diagonal ones 
for N « T^/". In our case, a = 1.7, A^/T^/" « 1/3. 
The range of the spectrum of C will not grow with T for 
cr = 2/a. 

The normalization constant F — {TtC)/N, which we 
have introduced previously for the experimental covari- 
ances, behaves for RLM with l<Q;<2asr = ^T^/""''. 
Again it simplifies for the choice a — 2/a. 
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V. STATES 

In this section we analyze the eigenvector content of 
the three covariances using the inverse participation ra- 
tios and the stock scattering. We show that the covari- 
ances with larger tails are more stable under reshuffling 
the SP500 data, with localized states at the edge of the 
spectrum. 

A. Inverse Participation Ratios 

To better understand the nature of large-eigenvalues in 
the SP500 data, we now turn to the eigenvector content. 
For that we use the inverse participation ratio 

N 

Ya = E ^A. (15) 
where V\ is a normalized |^ eigenvector: 

N 

1=1 

of C to the eigenvalue A. We can distinguish the 'mixed' 
states with Y\ « 1/iV « and 'pure' states with Y\ « 1. 

In Figs. ^ we display the inverse participation ratios 
for the three covariances C, G and J for the raw SP500 
data (triangles) and reshuffled SP500 data (pluses). Ad- 
ditionally, in the fourth insert we compare the inverse 
participation ratio for C' (triangles) and for RLM with 
index a — 1.5. The large eigenvalues are localized for J 
and C with intermediate and large participation ratio, 
respectively. For G and J the largest eigenvalue states 
are 'mixed' while for C they are 'pure'. Large eigenvalue 
pure states are present also in the C covariance ! This 
is clearly displayed by the data. The inverse participa- 
tion ratio as a function of eigenvalue for C' has the same 
character as RLM. 

The Levy randomness has an equally strong effect on 
the large part of the distribution as the inter-stock cor- 
relations. Also, in the G and J covariances reshuffling 
'breaks' the clustering, removing from the spectrum large 
eigenvalues. 

Concerning the lower part of the spectrum, it is inter- 
esting to note that it is characterized by smaller partici- 
pation ratios than the large eigenvalue part. The excep- 
tion is G for which the part with small eigenvalues has 
larger inverse participation ratio. On the other hand, the 
low eigenvalue parts of the G and C show a similar be- 
havior. In fact, one expects that the shape of the lower 



^This implicitly assumes that the entries Vxi are at least 
power law distributed with index a > 1. 



part of the spectra for RLM and RGM to be strongly 
related to the asymmetry of the matrices rather than to 
the type of randomness. Indeed, for matrices with asym- 
metry a < 1, the spectra exhibit exact zero eigenvalue 
states (zero modes). 
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FIG. 6. Inverse participation ratios for for the SP500 co- 
variances, for the historically ordered data (crosses) and for 
the reshuffled data (triangles). In the lower right figure we 
compare the distribution of the reshuffled data (triangles) and 
of Random Levy Matrix (circles). 

B. Stock scattering 

In analogy to the inverse participation ratio we define 
a quantity: 

A 

which measures how many eigenstates are mixed in a 
pure stock state i. We will refer to it as the stock scat- 
tering. Again we have the normalization: 

E ^A, = 1 (18) 

A 

The inverse 1/Pi tells us how many eigenstates are influ- 
enced by the stock i. 

In Fig. ^ we show the value of Pi for the consecutive 
406 stocks (horizontal) with < Pi < 1 (vertical), for 
each of the three covariances discussed in this work, at 
the same scale to facilitate the comparison. Clearly, the 
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normalization to unit volatility drives Pi towards gaus- 
sian noise. We have checked that the effects of reshuffling 
is to enhance the localization of certain stocks (without 
much affecting the original ones), in agreement with the 
participation ratio analysis. The stock scattering of C 
provides a relatively simple filter for those stocks that 
localize in a market, and are likely to drive the large tail 
behavior of the covariance matrix. 
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FIG. 7. Stock scattering for the SP500 covariances for 
the historically ordered data (lines) and the reshuffled data 
(dots). 



VI. MODEL 

Collecting all experimental evidence for the SP500 
data we are led to the conjecture, that the returns nor- 
malized to the initial price Mti = mu/xoi undergo fluc- 
tuations which can be well described by a randomness of 
the type: 

Mu = sgn,, • (19) 

where sgn^^ is a random matrix of correlated signs and 
are identical independent distributed numbers from the 
Levy universality. Indeed, as shown in Fig. |^ the sign 
correlations are present in all covariances. Additionally, 
one can check that the substitution of signs of dxi{t) by 
random signs has the same effect on spectra of the SP500 
covariances as reshuffling. 

Moreover, the spectra of eigenvalues of G and S are 
almost identical and can be seen from the Fig. ^ and 
from the comparison of a few largest eigenvalues which 
are 6.89, 7.27 and 41.95 for G and 5.48, 7.43 and 43.25 
for S. This tells us that the information about the inter- 
stock correlations present in G is already present in S. 
Thus, as long as the long tails are suppressed, as in G, 
the absolute value of the fluctuations docs not matter. 

The absolute value of the changes matters, however, 
if we do not introduce any superfluous normalizations 



and expose the covariance[| to large price fluctuations as 
in C. In this case, the correlations of signs play a sec- 
ondary role, since even when one decorrelates them by 
reshuffling, the large eigenvalues stay in the spectrum. 

Similarly as under reshuffling, the spectrum of C does 
not change its character and the largest eigenvalues are 
quite stable, if one uses randomly generated signs in- 
steads of those inherited from the historical data. 

The randomness given by the formula ( p^ ) captures 
many experimentally observed features of the real data. 
It should be treated, however, as as zeroth order approx- 
imation. In a more involved analysis, one should intro- 
duce some corrections to the conjecture which for exam- 
ple can take into account the possibility of correlations 
between sign and absolute value of the price changes. In- 
deed, the data show the existence of such correlations. 

VII. CONCLUSIONS 

We have carried an empirical analysis of the covari- 
ance matrices characterizing daily price returns from the 
SP500 market. We have shown that a specific covariance 
(returns normalized to the stock initial price) exhibits 
matrix entries with almost stable Levy tails. A com- 
parative study shows that only this covariance is stable 
under reshuffling, with a spectrum in remarkable agree- 
ment with the one extracted from an ensemble of ran- 
dom Levy matrices with commensurate sizes and asym- 
metry. An analysis of the corresponding participation 
ratio shows large localized and almost 'pure' states. This 
is not the case of the other covariances (returns normal- 
ized to the stock mean mean volatility or range), which 
are characterized by 'mixed' states with one character- 
istically large and delocalized eigenvalue reminiscent of 
Yang's ODLRO |l^ . The stock content of the localized 
states is best displayed using the stock scattering. 

In nearly gaussian markets, the risk is usually assessed 
by minimizing the variance of a pertinent market policy, 
say an investment portfolio, using the empirical market 
covariance as suggested by Markowitz Recently, it 
was pointed out that the low-lying eigenvalues of the em- 
pirical market covariance are gaussian noise dominated 
(information free), implying that standard Markowitz 's 
theory for risk assessment is flawed [Q. In non-gaussian 
markets, the potential for large asset fluctuations may re- 
quire using an alternative to Markowitz 's theory through 
the use of value-at-risk or tail-covariance ||l|, each of 
which requiring the covariance matrix. 



■^One can define other covariances, like for example, the one 
constructed from instantaneous returns: Ma = mu/xu, or 
Mti = logSxiit + l)/5xi{t), which similarly to C, would 
preserve information about the power-law tails. As a conse- 
quence, for instance, their spectra would behave in the same 
way under the reshuffiing as the one of C. 
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In the present work we have shown that eigenvalues of 
a market covariance follow the theoretical distribution of 
eigenvalues of almost randomly generated Levy matrices. 
The empirical market covariance reflects on a state of 
maximum entropy in the generalized sense of Dyson for 
random Levy matrices. Our observations maybe relevant 
for assets diversification and risk management. 
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